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Abstract. The implication problem for the class of embedded depen¬ 
dencies is undecidable. However, this does not imply lackness of a proof 
procedure as exemplified by the chase algorithm. In this paper we present 
a complete axiomatization of embedded dependencies that is based on 
the chase and uses inclusion dependencies and implicit existential quan¬ 
tification in the intermediate steps of deductions. 
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1 Introduction 

Embedded dependencies generalize the concept database dependencies within 
the framework of first-order logic. Their implication is undecidable but however 
recursively enumerable, thus enabling complete axiomatizations. A standard ex¬ 
ample of such a proof procedure is the chase that was invented in the late 1970s 
|1I2) , and then soon extended to equality and tuple generating dependencies [3] . 
In this paper we present an axiomatization for the class of embedded dependen¬ 
cies that simulates the chase at the logical level using inclusion dependencies. In 
particular, completeness of the rules is obtained by constructing deductions in 
which all the intermediate steps are inclusion dependencies, except for the first 
and the last step. These inclusion dependencies consist of attributes of which 
some are new, i.e., such that they are not allowed to appear at any earlier stage 
of the deduction. 

As a background example, consider the combined class of functional and in¬ 
clusion dependencies. It is well known that the corresponding implication prob¬ 
lem is undecidable, lacking hence finite axiomatization |4l5j . In these situations, 
one strategy has been to search for axiomatizations within a more general class of 
dependencies, and partly for this reason many different dependency notions were 
introduced in the 1980s. For instance, a textbook on dependency theory from 
1991 considers more than 80 different dependency classes [B]. In [7] Mitchell pro¬ 
posed another strategy by presenting an axiomatization of functional and inclu¬ 
sion dependencies using a notion of new attributes which are to be thought of as 


implicitly existentially quantified. In this paper we take an analogous approach, 
and present an axiomatization for embedded dependencies where new attributes 
correspond to new values obtained from an associated chasing sequence. These 
attributes are implicitly existentially quantified in the sense of team semantics, 
that is, a semantic framework that has teams, i.e., sets of assignments, as its un¬ 
derlying concept [S] . Team semantics is compositionally applicable to logics that 
extend first-order logic with various database dependencies [snn]. In this set¬ 
ting, inclusion logic, i.e., first-order logic with additional inclusion dependencies, 
captures positive greatest fixed-point logic and hence all PTIME recognizable 
classes of finite, ordered models |ll | 12 | 13j . Therefore, inclusion dependencies 
with new attributes can be thought of as existentially quantified inclusion logic 
formulae which in turn translate into greatest fixed-point logic. Moreover, all 
existentially quantified dependencies that appear in deductions translate into 
existential second-order logic. This may in part enable succinct intermediate 
steps in contrast to axiomatic systems that simulate the chase by composing 
first-order definable dependencies. 

The methods described in this paper generalize the axiomatization of condi¬ 
tional independence and inclusion dependencies presented in M- It is also worth 
noting that extending relations with new attributes reminds of algebraic depen¬ 
dencies, that are, typed embedded dependencies defined in algebraic terms. The 
complete axiomatization of algebraic dependencies presented in [15] involves also 
an extension schema that introduces new copies of attributes. 

2 Preliminaries 

For two sets A and B, we write AB to denote their union, and for two sequences 
ab, we write ab to denote their concatenation. For a sequence a = (ai,..., a„) 
and a mapping /, we write f{a) for (/(ai),...,/(a„)). We denote by id the 
identity function and by pr^ the function that maps a sequence to its ith pro¬ 
jection. For a function / and A C Dom(/), we write /|^ for the restriction of / 
to A, and for a set of mappings F, we write for {/|^ : / £ F}. 

We start by fixing two countably infinite sets Val and Att, the first denoting 
possible values of relations and the second attributes. For notational convenience, 
we will assume that Val = Att. For R C Att, a tuple over i? is a mapping R Val, 
and a relation over i? is a set of tuples over R. We may sometimes write r[R\ 
to denote that r is a relation over R. Values of a relation r over R are denoted 
by Val(r), i.e., Val(r) := {t{A) : t £ r,A £ R}. Let / be a valuation, i.e., a 
mapping Val ^ Val. Then for a tuple t, we write f{t) := f ot, and for a relation 
f{f) •= {f{t) ■ t G ''■}• A valuation / embeds a relation r (a tuple t) to r' if 
/(r) C r' (fit) £ r). Since we are usually interested only valuations of a relation, 
we say that / : Val(r) —^ Val is a valuation on r. For a valuation / on r, we say 
that g is an extension of / to another relation r' if (7 is a valuation on r' such 
that it agrees with / on values of Val(r) n Val(r'). 

Embedded dependencies (ed’s) can be written using first-order logic in the 
following way. 
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Definition 1 (Embedded dependency). Embedded dependency is a first- 
order sentence of the form 

Vxi,.. ...,Xn)^3zi. ..3zkip{yi,.. .,ym)) 

where {zi,...,Zk} = {yi,... ,ym}\{xi,... ,Xn} and 

— (j) is a (possibly empty) conjunction of relational atoms using all of the vari¬ 
ables Xi, ... ,Xn; 

— if is a conjunction of relational and equality atoms using all of the variables 
zi,. ■ ■,Zk; 

— there are no equality atoms in if involving existentially quantified variables. 

If at most one relation symbol occurs in an ed, then we say that the ed is 
unirelational, and otherwise it is multirelational. An ed is called typed if there 
is an assignment of variables to column positions such that variables in relation 
atoms occur only in their assigned position, and each equality atom involves 
a pair of variables assigned to the same position. Otherwise we say that an 
ed is untyped. If if contains only one atom, then we say that the ed is single- 
head, and otherwise it is multi-head. A single-head ed where if is an equality is 
called an equality generating dependency (egd). If '0 is a conjunction of relational 
atoms, then the ed is called a tuple generating dependeney (tgd). For notational 
simplicity, we restrict attention to unirelational ed’s. It is easy to se that any ed 
is equivalent to a set of tgd’s and egd’s, and hence we restrict attention to ed’s 
that belong to either of these subclasses. 

The following alternative tableau presentation for egd’s and tgd’s are used 
in this paper. 

Definition 2. Let T and T' be finite relations over R, and x,y S Val(T). Then 
(r, X = y) and (T, T') are an egd and a tgd over R, respectively, with the below 
satisfaction relation for a relation r over S A R: 

— r \= {T,x = y) for all valuations f such that f(T) C r|^, it holds that 
fix) = f{y). 

— r \= (T, r') for all valuations f on T such that /(T) C r\R, there is an 
extension g of f to T' such that g{T') C r|^. 

Sometimes we write to denote that cr is a dependency over R. If T or T' 
is a singleton, then we may omit the set braces in the notation, e.g., write (T, t) 
instead of (T, {t}). 

We also extend valuations to dependencies. For an egd a = {T,x = y) we 
write Val((T) = Val(r), and for a tgd r = {T,T') we write Val(tT) = Val(T) U 
Val(T'). Moreover, if / is a valuation, then /(tr) = {f{T),f{x) = f{y)) and 
fir) = if{T),f{r)). 

Example 1. Consider the relation r and the tgd’s cti := {{t, t'}, {it}) and cr 2 := 
{{t,t'}, {v,v'}) obtained from Fig. [Ifl We notice that there are two valuations 

^ In a tableau presentation of a dependency a, the distinct values of a are sometimes 
denoted by blank cells. 
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on that embed to r, namely / := {(x, 0 ), (j/, 1 ), (z, 2 )} and g := 

{(x, 3), (y, 0), (z, 1)}. Then r \= ai since / and g embed u into r, witnessed 
by tuples S 2 and S 3 , respectively. We also notice that r ^ (T 2 since, although 
/ U {(a, 3)} embeds {x, x'} into r, no extension of g does the same. 
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Next we define inclusion dependencies which are examples of possibly un¬ 
typed tgd’s. 

Definition 3 (Inclusion dependency). Let ^ 1 ,..., An and Bi,..., Bn be 

(not necessarily distinct) tuples of attributes. Then Ai... An ^ Bi... Bn is an 
inclusion dependency (ind) over R = {Ai,Bi : i = 1,..., n} with the following 
semantic rule for a relation r over S A R: 

r \= Ai... An Bi ... Bn Vs € r3s' G rVi = 1,..., n : s(Ai) = s'{Bi). 

The axiomatization presented in the next section involves inclusion dependencies 
that introduce new attributes. These attributes are here interpreted as existen¬ 
tially quantified in lax team semantics sense [9]: 

r 1= 3A(j) r[f/A] \= (f for some f : r ^ T’(Val) \ {0}, (1) 

where r[f/A] := {t{x/A) : x G f{A)} and t{x/A) is the mapping that agrees with 
t everywhere except that it maps A to x. Interestingly, inclusion logic formulae 
with this concept of existential quantification can be characterized with positive 
greatest fixed-point logic formulae (see Theorem 15 in [TT]1. 

3 Axiomatization 

In this section we present an axiomatization for the class of all embedded de¬ 
pendencies. The axiomatization contains an identity rule and three rules for the 
chase. We also involve conjunction in the language and therefore incorporate its 
usual introduction and elimination rules in the definition. Regarding the equal¬ 
ities that appear in the rules, note that both AB C A A and AB f- BB indicate 
that the values of A and B coincide in each row. Therefore, we use A = B to 
denote ind’s of either form. For a tgd (an egd) cr, we say that x G Val(cr) is 
distinct if it appears at most once as a value in a. Namely, 
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- for a tgd a = (T, T') [i?], x is distinct if for €TUT' and A, B € R, ii 

t{A) = X = then t = t' and B = B'; 

- for an egd a = {T,y = z)[R], x is distinct if a; ^ {u^z} and for all t,t' & T 
and A, B G R, if t{A) = x = t'{B), then t = t' and B = B'. 

Lastly, note that in the following rules we assume that values can appear as 
attributes and vice versa. 

Definition 4. In addition to the below rules we adopt the usual introduction 
and elimination rules for conjunction. In the last three rules, we assume that A 
is a sequence listing the attributes of R. 


EE Equality Exchange: 

if A = B Aa, then r. 

where a is an ind and r is obtained from a by replacing any number of 
occurrences of A by B and any number of occurrences of B by A. 

CS Chase Start: 

{T*,id)[RS]A /\ t{A) C A 

teT 

where T = T*|^, S' = Val(T) consists of new attributes, and R consists of 
distinct values. 

CR Chase Rule: 

tgd: if (T, /\ f ° ^ then /\ .f ° ^ 

teT t'GT' 

egd: if (T, x = y)[R] A f\ f o t{A) C A, then f{x) = f(y), 

tGT 

where tgd: f is a valuation that it is 1-1 on Val(T') \ Val(r), and f{x) is a 
new attribute for x € Val(r') \ Val(T). 

CT Chase Termination: 

tgd: i/(T*, id)[i?S] A uot'{A)CA, then (T,T')[R], 

t'&T' 

egd: if {T* ,id) [RS] Ax = y, then (T, x = y) [i?], 

where T = T*\^, S = Val(T), and Val(T*|g) consists of distinct values. 
Moreover, tgd: u is a mapping Val(r') —Att that is the identity on Val(r) n 
Val(T'), and egd: x,y G Val(T). 

For a dependency a over R, we let Att((T) := R, and for a set of dependencies 
E, we let Att(Il’) := Ucrei: Att(CT) . 

Definition 5. A deduction from S is a sequence (cti, ..., cr„) such that: 

1. Each Ui is either an element of E, an instance of [CS], or follows from one 
or more formulae of {ui,..., by one of the rules presented above. 
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2. For each A e Att(cri), if A is new in Ui, then A ^ Att(Z’ U {cti, ..., at-i}), 
and otherwise A € Att(Z’ U {cti, ..., tTi_i}). 

We say that a is provable from S, written E \- a, if there is a deduction 
(ui,..., CTri) from S with a = an and such that no attributes in a are new 

%Tl , . . . O'11. 

We will also use the following rules that are derivable from [EE]: 


ES Equality Symmetry: 


ET Equality Transitivity: 


\i A = B, then B = A. 


A A = B A B = C, then A = C. 

One may find the chase rules slightly convoluted at first sight. However, the ideas 
behind the rules are relatively simple as illustrated in the following examples. 

Example 2 (Chase Start). Let aQ := ({to, ti}, {uo})[i?5'] be as in EigurejU 
for R := {A, B, C} and S := {cc, y, z}. Then 
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r := (To A xyz C ABC Axy C BC 

is an instance of [CS]. Here x,y,z are interpreted either as values or as new 
attributes. By the latter we intuitively mean that any relation r[ABC] can be 
extended to some r'[ABCxyz] such that r' ^ r. For instance, one can define 
r' := q(r) where q is the following SPJR query 

ABC [X] {j^xyz[cfxy—Bc{.Pxyz /^ 

where a refers to (S)election, tt to (P)rojection, txi to (J)oin, and p to (R)ename 
operator. Then q{r) is a relation over RS such that its restriction to xyz lists 
all abc for which there exist s,s' € r such that s{ABC) = abc and s'{BC) = ab. 
Let (Ti = ({to,ii}, {ui})[R] be as in Figured Now, 

r \= ai q{r) \= zx C AC. 

Hence proving E ax reduces to showing that E U {r} \= zx C AC. 
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Example 3 (Chase Rule). Assume 


tT 2 A xyz C ABC Axy C BC 


( 2 ) 


where (J 2 = ({to ^ 1 }) {^ 2 , U 3 })[i?] is as in Fig. [51 for R := {A,B,C}. Then, 
interpreting / as id, one can derive with one application of [CR] 


(3) 


zv C AC Avz C AC 


from ©■ Note that in {|3|) v is interpreted as a ticw attribute, and the idea is that 
any relation r\R\ satisfying ([5]) and with v ^ R can be extended to a relation 
r'[R U {u}] satisfying ([3]) by introducing suitable values for v. 

Example 4 (Chase Termination). Assume 


(4) 


aQ A zx C AC 


where ctq = ({to, ti}, {'WoDl^'S'] is as in Fig. [51 for R := {A,B,C} and S := 
(x, y, z}. Then, letting u = id, one can derive cti as in Fig. [5] from (|3|) with one 
application of [CT]. 

4 Soundness Theorem 

In this section we show that the axiomatization presented in the previous section 
is sound. First note that the next lemma follows from the definitions of egd’s, 
tgd’s and ind’s. 

Lemma 1. Let a be a dependency over R, and let r and r' be relations over 
supersets of R and with r Ifl — ’'"'Ir- Then r |= cr <;=> r' |= tr. 

Then we prove the following lemma which implies soundness of the axioms. For 
attribute sets R, R' with R C R' and a relation r over R, we say that a relation r' 
over R' is an extension of r to R' if r'\R = r. Recall from equation |T] that exactly 
such extensions are used in the existential quantification of lax team semantics. 

Lemma 2. Let r be a relation over Att(A’) such that r \= S, and let (cri,..., (T„) 
be a deduction from S. Then there exists an extension r' of r to Att(A' U 
(cti, ..., On\) such that r' \= E\J {ui,..., an}- 

Proof. We prove the claim by induction on n. We denote by the set Att(A' U 
{(Ti,..., CT„}). Assuming the claim for n —1, we first find an extension r„_i of r to 
Rn-i such that r„_i ^ A U {cti, ..., an-i}- If cr„ is obtained by an application 
of a conjunction or some ind rule, then it is easy to see that we may choose 
fn ■= Tn-i- Hence, it suffices to consider the cases where (T„ is obtained by using 
one of the chase rules. Due to Lemma [TJ it suffices to find an extension r„ of 
r„_i to Rn such that r„ |= an- In the following cases, A denotes a sequence 
listing the attributes of R C Rn-i- 
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Case [CS]. Assume that ct„ is obtained by [CS] and is of the form 

{T*,id)[RS]A /\ t{A) C A 

teT 

where T = T*|^, S = Val(T) consists of new attributes and R of distinct values. 
Let r„ := r„_i [xi r be an extension of r„_i to Rn = Rn-iS, where 

r := {ft, : ft, is a valuation on T such that ft(T) C 

We claim that r„ \= Un- Consider the first conjunct of and let ft be a valuation 
on T* such that h{T*) C Then h\g is is a valuation on T such that 

hlT) C r„|^ = i.e., h\g = tols for some to G r„. Since R consists of 

distinct values and thus R fl Dom(ft) = 0, we may define ft' as an extension 
of ft with A i-> to (A), for A G R. Then ft'l^g = tol^g G ^n|_RS) s-od therefore 
rn h {T*Ad)[RS]. 

Consider then t{A) C A, for t G T, and let to G rn- By the definition, 
tolg = ft for some valuation ft on T such that h{T) C r„|^, and hence we obtain 
that to o t{A) = ho t{A) = ti(A) for some ti € r„. Therefore, r„ |= t{A) C A. 

Case [CR]. Assume that cr„ is of the form (i) At'gT' / ° ^'i^) G A or (ii) 
f{x) = f{y), and is obtained by [CR] from 

(i) {T,r)[R]A;\^^^fot{A)CA, 

(ii) (T, X = y)[R] A / o t(A) C A, 

where in case (ii) / is a valuation on T U T' such that it is 1-1 on S := Val(r') \ 
Val(T) and f(x) is a new attribute for x G S. Let s G r„_i. Since r„_i \= 
/\tGT f ° ^(A) G A, we first obtain that s o f{T) C 

(i) Since r„_i \= (T,T')[R] we find a mapping g : S -G Val such that ft(T') C 
r„_i 1^, for ft = gU (s o /). Since / is 1-1 on S, we can now define r„ as the 
relation obtained from r„_i by extending each s G r„_i with f(x) i—?■ g{x) 
for X G S. Then for each s G rn, s o f{T') C r„|^, and hence we obtain 
that r„ ^ At'GT' / ° ^'(A) G A. 

(ii) It suffices to show that r„_i \= f{x) = f{y). Since so f{x) = so f[y) by 
r„_i 1= {T,x = J/)[R], this follows immediately. 

Case [CT]. Assume that ct„ is of the form (i) (T, T')[i?] or (ii) (T, x = i/)[i?] 
and is obtained by [CT] from 

(i) (T*, id)[RA] A At'er' ° ^^(A) G A, where m is a mapping Val(T') ^ Att 
that is the identity on Val(T) D Val(r'), 

(ii) (T*, id) [RA] Ax = y, where x,y G Val(T). 

Moreover, in both cases T = T*]^, S = Val(T), and Val(r*|g) consists of distinct 
values. It suffices to show that r„_i ^ ct„, so let ft be a valuation on T such that 
h{T) C Since Val(T*|g) consists of disctinct values, ft can be extended 



to a valuation h' on T* such that h'{T*) C Since r„_i |= (T*,id)[-RS*], 

there is an extension h" of h' to attributes in R such that h"\^g S 
Hence, we obtain that h\g G r„_i|g. Let then s € r„_i be such that it agrees 
with h on S. 

(i) Since r„_i |= At'eT' ^ ° ^'i^) — obtain that s o u{T') C 

Moreover, we notice that s o u = h on Val(T) n Val(T'). 

(ii) Since r„_i |= x = y, we obtain that s{x) = s{y). Then h{x) = hijj) since 
x,y € S. 

Hence, in both cases we obtain that r„_i \= an- This concludes the [CT] case 
and the proof. □ 

Using the previous lemma, soundness of the rules follows. 

Theorem 1. Let E U {cr} he a finite set of egd’s and tgd’s over R. Then E \= a 
if EL a. 

Proof. Let r be a relation such that r |= U, and assume that (cti, ... ,an) is a 
deduction from E where a = an contains no attributes that appear as new in 
iji,..., (T„. If i?' := Att(T'U{cri,..., an})-, then by Lemma[2]we find an extension 
r' of r|^ to R' such that r' a. Then using Lemma [T] we obtain that r |= cr. □ 

5 Chase Revisited 

In this section we define the chase for the class of egd’s and tgd’s. The chase 
algorithm was generalized to typed egd’s and tgd’s in |3], and here we present 
the chase using notation similar to that in m- First let us assume, for nota- 
tional convenience, that there is a total, well-founded order < on the set Val, 
e.g., xi < X 2 < X 3 < ... for Val = {xi, X 2 , X 3 ,...}. Let U U {cr} be a set of 
egd’s and tgd’s over R. A chasing sequence of a over U is a (possibly infinite) 
sequence ctq, cti, ..., cr„,... where cto = a, and ct„+i is obtained from cr„, with 
T pr;^(cr„), according to either of the following rules. 

Let r G U be of the form (S, x = y), and suppose that there is a valuation / 
on S such that f{S) C T but /(x) 7 ^ f{y)- Then r (and /) can be applied to cr„ 
as follows: 

— egd rule: Let cr„+i := g(cr„) where g : Val —Val is the identity everywhere 
except that it maps f{y) to /(x) if /(x) < f{y), and /(x) to f{y) if f{y) < 

f{x)- 

Let r G U be of the form {S,S'), and suppose that there is a valuation / on 
S such that f{S) C T, but there exists no extension /' of / to S' such that 
f{S') C T. Then r can be applied to cr„ as follows: 

— tgd rule: List all /i,..., /„ that have the above property, and for each fi 
choose a distinct extension to S', i.e., an extension /' to S' such that each 
variable in Val (5”') \ Val (S') is assigned a distinct new value greater than any 
value in Val(cro) U ... U Val(cr„). Moreover, no new value is assigned by two 
fi, f'j where i ^ j. Then we let cr„+i : (T U /{(S') U ... U /^(.(S'), pr 2 (cr„)). 
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Construction of a chasing sequence is restricted with the following two con¬ 
ditions: 

(i) Whenever an egd is applied, it is applied repeatedly until it is no longer 
applicable. 

(ii) No dependency is starved, i.e., each dependency that is applicable infinitely 
many times is applied infinitely many times. 

Let {S, a) = (To, CTi) • ■ • be a chasing sequence of a over E. Due to the possibility 
of applying egd’s, a chasing sequence may not be monotone with respect to C. 
Hence, depending on whether cr is a tgd or an egd, we define 

- egd: chase(N',CT) := {T^,x = y), 

- tgd: chase(N',CT) := (r^,T^), 

where T* := {u : 3mVn > m(u € prj((T„))} and x = y is pr 2 ((T„) for n G N such 
that pr 2 ((T„) = pr 2 (CTm) for all m > n. Note that “newer” values introduced 
by the tgd rule are always greater than the “older” ones, and values may only 
be replaced with smaller ones. Hence, no value can change infinitely often, and 
therefore chase(N', a) is always well defined and non-empty. 

We also associate each chasing sequence with the following descending val¬ 
uations pn, for n > 0. We let po = id, Pn+i = 5 o Pn if <yn+i is obtained by an 
application of the egd rule where (t„+i = g((T„), and Pn+i = id o p„ otherwise. 
We then define p{x) = lim„_,.oo p„(x), i.e., p{x) = Pn{x) if n G N such that 
Pm{x) = Pn{x) for all m > n. Then we obtain that 

OO 

chase(N',CT) = |J p((J„). 

n—0 


A dependency r is trivial if 

— T is of the form (T, x = x), or 

— r is of the form (T, T') and there is a valuation / on T' such that / is the 
identity on Val(r) fl Val(r') and f{T') C T. 

It is well-known that the chase algorithm captures unrestricted implication 
of dependencies. The proof of the following proposition is hence located in Ap¬ 
pendix. 

Proposition 1. Let S U {a} be a set of egd’s and tgd’s over R. Then the fol¬ 
lowing are equivalent: 

(i) A 1= cr, _ 

(ii) there is a chasing sequence {E,a) = ao,ai,... of a over E such that 
chase{E, a) is trivial, 

(Hi) there is a chasing sequence (E, u) = ctq, (Ti, ... of a over E such that an is 
trivial, for some n. 
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6 Completeness Theorem 

In this section we show that the rules presented in Definition |4] are complete for 
the implication problem of embedded dependencies. Let us first illustrate the 
use of the axioms in the following simple example. 

Example 5. Consider the implication problem {ct, cr'} |= r where a,a',T are 
illustrated in Fig.[3l e.g., tr = (T, t) where T consists of the top two rows of a and 
t is the bottom row. Note that a and t are embedded multivalued dependencies 
of the form A B\C and A B\CD, respectively, and a' is a functional 
dependency of the form C ^ D. It is easy to see that the implication holds, and 
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this can be also verified by a chasing sequence to,ti,T 2 of r over {cr, tr'} where 
T 2 is trivial. In the chasing sequence, tq = r and ri is the result of applying 
a to Tq. For this, note that there exists two valuations on T that embed T to 
pr]^(ro) but has no extension that embeds t into pr]^(To). These valuations are 
the identity and the function / that swaps the values of the top and bottom row 
of T. Then ti is obtained by adding to pr^(To) id*(t) and f*{t) where id* and 
/* are distinct extensions of id and / to t, e.g., id* = id also on d 2 and /* maps 
^2 to d^. Also, T 2 is the result of applying a' to ti two times, i.e., T 2 is obtained 
from Ti by replacing d^ with do and d 2 with di. Clearly T 2 is trivial, and hence 
we obtain the claim by Proposition [TJ 
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This procedure can now be simulated with our axioms as follows. First, with 
one application of [CS] we derive 

(T, id)[i?5'] A aobocodo Q ABCD A aobicidi C ABCD 
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where T = R = {A,B,C,D}, and S = {oq, &o, cq, ci, do, di} is a set 

of values that are interpreted as new attributes. Here t(x) and t'{x), for x € S, 
and A, B, C, D are interpreted as distinct values. (T, is illustrated in Fig. 

O where all the distinct values are hidden. Now with one application of [CR], 
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Fig. 5. {T,id)[RS] 


letting / = id, we derive ao&oCid 2 C ABCD from 

a A aobocodo C ABCD A apdicidi C ABCD (5) 

Note that in this step, d 2 is interpreted as a new attribute. Let then / be the 
valuation that is the identity on ao,6o,^i)di, and otherwise maps ai n- ap, 
cp i-A Cl, and dp M- d2- We notice that /(ap&pCpdp) = op6pCid2 and /(ai^iCpdi) = 
op^icidi. Hence, we may derive with one application of [CR] /(dp) = /(di), i.e., 
d 2 = di from 


a' A /(op&pcpdp) C ABCD A /(oifoicpdi) C ABCD. 

Then we apply [EE] and derive ap&pCidi C ABCD from 

d2 = di A Up6pcid2 C ABCD 

Finally, we may apply [CT] and derive r from (T, id)]^^] A apfepCidi C ABCD. 

The following lemma shows that the above technique extends to all chasing 
sequences. The proof is straightforward and located in Appendix. 

Lemma 3. Let {S,a) = cro,cri,... be a chasing sequence of a over E, where 
E U {a} is a finite set of egd’s and tgd’s over R, let A be a sequence listing 
the attributes of R, let T := pr]^(CT) and R := and let n G N. Then 

there exists a deduction from E, with attributes from i? U Val(Ti), listing 
the following dependencies: 

(i) (T*,id)[i?5'] where T*]^ = T, S = Val(T), and T'*]^ consists of distinct 
values, 

(ii) f{x) = f{y), for each application of {S,x = y) and f to am, for m < n, 
(Hi) t(A) C A, for t e Tm where m <n. 

With the lemma, we can now show completeness. 

Theorem 2. Let E U {a} be a finite set of egd’s and tgd’s over R. Then E ^ 
a ^ E \- a. 
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Proof. Assume that S \= a, and let A be a sequence listing R. Then by Propo¬ 
sition [T] there is a chasing sequence (A, a) = ctq, tri,... of ct over S such that 
is trivial for some n. Let D = (ri,..., r;) be a deduction from S obtained by 
LemmaO and let T := pr]^(tT) and Ti := pr 2 (cri). 

Assume first that a is an egd of the form (T, x = y). Then tT„ is (T„, z = z) 
where z = Pnix) = Pniv)- Now, either pi^i{x) is Pi{x), or the equality pi+i{x) = 
Pi{x) (or its reverse) is listed in D by item (ii). Hence, using repeatedly [ES,ET] 
we may further on derive z = x. Since z = y is derivable analogously, we therefore 
obtain a: = y by [ES,ET]. Then with one application of [CT], we derive (T, x = y) 
from (T*, id)[i?^] A x = y where T*|^ = T. Note that the (T*, id)[i?S'] of the 
correct form is listed in D by item (i) of Lemma |3l 

Assume then that cr is a tgd of the form (T, T'), and let T/ := pr 2 (cri). Then 
Un is {Tn,Tf), and there is a valuation / on Tf such that / is the identity on 
Val(T„) n \/a\{Tf) and f{Tf) C Tn- Let t' € T'. Then pnot' G Tf and by item 
(iii) of Lemma [3] we obtain that / o o t'{A) C A is listed in D. For A £ R,we 
have then two cases : 

— If t'{A) G Val(r') n Val(T), then we first notice that / o o t'{A) is o 
t'{A) since Pn o t'{A) G Val(T))) n Val(T„). Also we notice that the equality 
Pn o t'{A) = t'{A) can be derived analogously to the egd case. 

— If t'{A) G Val(T')\Val(T), then f op^ot'{A) = fot'{A) since by the definition 
of the chase is the identity on Val(T') \ Val(T). 

Now, letting /* be the mapping Val(T') —s- Att which is the identity on Val(T') n 
Val(T) and agrees with / on Val(T') \ Val(T), we can by the previous reasoning 
and using repeatedly [EE] derive f* ot'{A) C A from /op„ot'(A) C A. Finally, 
we can then with one application of [CT] derive (T, T') from 

{T*,id)[RS]A /\ r ot'(A) C A. 

t'GT' 


□ 


7 Typed dependencies 

Consider then the class of typed embedded dependencies. In this setting [CS] and 
[CT] can be replaced with rules that involve only embedded join dependencies 
(ejd’s) and inclusion dependencies. We define ejd’s over tuples of attributes as 
follows. 

Definition 6. Let Ai,..., A„ be tuples of attributes listing i?i,..., Rn, respec¬ 
tively, and let R := Ur=i Then ixi(AdiLi 'Is an embedded join dependency 
with the semantic rule 

- r ]= 00 (Ad'Ll if and only */r|^ = ex ... ex 

The two alternative rules for the chase are now the following. We call a relation 
typed if none of its values appears in two distinct columns. 
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CS* Chase Start*: 


/\AC t{A)A c^(t(A))teT A f\ t{A) C A 

tGT teT 

where T is a typed relation and Val(T) is a set of new attributes. 

CT* Chase Termination*: 

tgd : if A A C t(A)A [x](t(A))tgT A /y uot'{A)CA, then (T, 

ter t'eT' 

egd : if /\ A C t(A)A [xi(t(A))tgT A a; = y, then {T,x = y)[i?], 

teT 

where tgd: u is a mapping Val(T') —Att that is the identity on Val(T') fl 
Val(r'), and egd: x,y G Val(r). 

The first rule is sound for typed dependencies since, for arbitrary r with 
Dom(r) nVal(T) = 0, an instance of [CS*] is satisfied by r cc q{r) where q is the 
SPJR query 

Pti{A)IAA IXl . . . CO Pt„{A)/AA, 

where p is the rename operator and T = {ti,... ,tn}- However, a counter example 
for soundness can be easily constructed for untyped dependencies. If T and r 
are the relations illustrated in Fig.[6l then no extension r' of r to Val(T) satisfies 
At^TKAB)CAB. 
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Fig. 6. 


Soundness of [CT*] is obtained analogously to that of [CT]. Also, com¬ 
pleteness is obtained by deriving exactly in the same way as in the general 
case, Ai'eT' ^ ° ^ tgd case) or x = y (in the egd case) from 

us then write A h* cr if cr is deduced from S in the sense 
of Definition [S] and using rules [EE,CS*,CR,CT*] together with elimination and 
introduction of conjunction. Then we obtain the following theorem. 

Theorem 3. Let S U {a} be a finite set of typed egd’s and tgd’s over R. Then 
E \=a ^ EL* a. 
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Appendix 

Proof. (Proposition [T|) Let S U {cr} be a set of egd’s and tgd’s over R. The 
direction (ii) (Hi) is clear because it suffices to choose tT„ such that all the 
relevant tuples and values remain fixed in am for m > n. We show (i) {ii) 
and (in) => (i). 

(i) => (ii): Assume that there is a chasing sequence (T',cr) = aQ,ai,... of 
a over E such that chase(A', a) is non-trivial. We claim that chase(A', tr) |= A 
and chase(A’, cr) ^ a. Let T„ denote pr^(CT„). Assume first that {S,x = y) £ E 
and assume to the contrary that / is a valuation such that f{S) C chase(A', a) 
but f{x) ^ f{y). Then there exists m € N such that f{S) £ Tn for all n > m, 
contradicting the assumption that no dependency is starved in the chase. 

A ssume that (S', S') £ E, and assume that / is a valuation such that /(S) C 
chase(A',CT), and let m G N be such that /(S) C Tn for all n > m. Then 
there is an extension /' of / to S' such that /'(S') C Tm' for some m' > m, 
where we define Tf := pr 2 (cr„). Note that o /'(S) C Tf for all n > to', 
and hence there exists to" > to' such that Pm" o f'{S) C Tf for all n > to". 
Since /(S) C chase(A’, a), pm" is the identity on /(S), and hence we obtain that 
chase(r, a) h (S, S'). _ 

Finally, we show that chase(A’, a) ^ a. Analogously to the previous case we 
find a valuation pn such that Pn{T) C pr^^ (chase(A', a)). If a is of the form (T, x = 
y), then we obtain that Pn{x) = Pn{y) is pr 2 (chase(A', tr)). Since chase(A’, cr) is 
non-trivial, pn(x) and Pn{y) must be two distinct values. Hence, pn witnesses 
chase(A',CT) ^ {T,x = y). 

Assume then that cr is of the form (T, T'). Then analogously Pn ° T £ 
pr;^(chase(A’, cr)) and o T' = pr 2 (chase(A', ct)) for some n G N. Also note 
that by the construction pn is the identity on Val(T') \ Val(T). Now, if there 
is an extension h of Pn|vai(T) such that h{T') C pr]^(chase(A’, cr)), then 

chase(A', a) is trivial. Hence pn is a witness of chase(A', a) ^ a. 

{in) ^ {i): Let {E, cr) = erg, cti, ... be a chasing sequence of a over A, where 
an is trivial, and let T (or T) denote pr]^(cri) (or pr 2 (CT)). Assume that r \= E, 
and let / be a valuation on T to r. Using the chase construction rules and the 
assumption it is easy to show inductively that for all n there is an extension /„ 
of / to such that 

(i) UTn) C r, 

(ii) fnOpn = fn- 

Assume first that a is of the form (T, x = y), and hence pn{x) = Pn{y). Then by 
the induction claim we obtain that f{x) = f{y). Assume that a is of the form 
(T,T'), and let h be a valuation such that h{Tf) C Tn and h is the identity 
on Val(T„) n Val(T))). Then fnoho pn{T') C r where, by the induction claim, 
fn o h o Pn is f on Val(T) n Val(T'). Hence, we obtain that r ^ cr in both cases. 
This concludes the proof. □ 

Proof (Lemma\^. W.l.o.g. we may assume that no attribute of R appears as a 
value in the chasing sequence, i.e., RCi show the claim by 

induction on n. 
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The base case. First it suffices to deduce by one application of [CS] 

iT*,id)[RS]A /\ t{A) C A 

tGT 

where (T*, id) is of the form described in (i). 

The inductive step. Assuming the claim for n, we next show the claim for 
n + 1. Assume first that (t„+i is obtained from cr„ by using the egd rule for 
{S,x = y) G S over a valuation f on S such that f{S) C T„ and f{x) ^ f{y)- 
Then Tri+i = 9 {Tn) where g is the identity everywhere except that it maps, say 
f{y) to f{x). By the induction assumption, it now suffices to consider (ii) and 
(iii) only in the cases that associate with the construction of cr„+i. 

(ii) The equality f{x) = f{y) can be derived with one application of [CR], since 
/ o s(A) C A, for all s G S, have been deduced by the assumption. 

(iii) Let t G T„+i, and let t' G T„ be such that t = g o t'. li f{y) ^ Val(t'), 
then t{A) C A has been derived by the induction assumption. Otherwise, 
t'{A) = f{y) for some A G R. Now using repeatedly [EE] to f{y) = f{x) and 
t'{A) C A, we obtain t(A) C A. 

Assume then that i7„+i is obtained from (t„ by using the tgd rule for (S, S') G 
S. W.l.o.g. we may assume that there is only one valuation f on S with the 
property that / embeds S to T„, but no extension of / to S' embeds S' to T^. 
Let /' be the distinct extension associated with this step, i.e., /' is an extension 
of / to S' such that each variable in Val(S'') \ Val(S') is assigned a distinct 
new value greater than any value appearing in (Tq ,..., Un. By the induction 
assumption, none of these new values appear in the deduction. Hence, by the 
assumption we may with one application of [CR] from {S, S') A AseS f' ° '®(^) 
deduce As'eS' f' ° ^ where all the new values are interpreted as new 

attributes. Since Tn+i = TnLi f'{S'), this concludes item (iii) and thus the proof. 

□ 
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